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As big data increasingly becomes a buzzword, Health In- 
formatics Research is diligently following the trends in this 
regard; book reviews of recent issues have introduced several 
books on ways to deal with and analyze data to enhance 
business [1-3]. Globally, big data initiatives have demon- 
strated interest in and attention towards the effective and 
efficient use of big data. Examples are the European Unions 
EUDAT, a major collaborative data infrastructure project in 
Europe [4] , and the National Institutes of Healths Big Data 
to Knowledge (BD2K) initiative for biomedical big data [5]. 
Another movement is National Consortium for Data Sci- 
ence (NCDS) in the United States that seeks to advance the 
application of data to solve challenging problems, create 
jobs, protect national security, and improve quality of life 
[6]. Especially, BD2K appears to be an interesting attempt as 
it would enable scientists to take advantage of the big data 
being generated by research communities in the biomedical 
fields [5]. 

Indeed, we live in a world where the academic society has 
experienced a cultural shift from claiming 'my-own data to 
actively sharing data and publication [5]. Hence, I want to 
add another book on data science, entitled, Data Smart: Us- 
ing Data Science to Transform Information into Insight. As the 
subtitle itself indicates, this book places great emphasis on 
data science. This resource is not a theory- and code-based 
heavy reading; rather, it can help readers utilize data as criti- 
cal insights for decision making. Whether you see yourself as 
book smart or street smart, this book will help you become 
data smart. 

The author, John W. Foreman, is the Chief Data Scientist for 
MailChimp.com, an email service powering subscriptions 
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for marketing campaigns. He has also worked with vari- 
ous organizations, such as the FBI, Department of Defense, 
Coca-Cola, and Intercontinental Hotels Group. Based on 
his background, Foreman uses examples and concepts from 
business; however, professionals working in healthcare will 
be able to apply this book to their fields as well. 
Foreman defined data science as "the transformation of data 
using mathematics and statistics into valuable insights, deci- 
sions, and products." Harvard Business Review published the 
article "Data Scientist: The Sexiest Job of the 21st Century," 
which claimed that data scientists are a new kind of breed [7] . 
In fact, the term 'data scientist' was introduced first in 2008. 
Data scientists continue to be in great need in this big data 
era. According to the article, "if 'sexy' means having rare 
qualities that are much in demand, data scientists are already 
there." 

The author of Data Smart aims to provide an introduction 
to the practice of data science in a comfortable and conver- 
sational manner, and I think that he has been successful. He 
wants his readers to replace their anxiety of data science with 
excitement and ideas on how to use data to the next level for 
business. This book does not talk about health data at all, but 
I certainly insist that readers will feel more confident about 
data science after reading up to the last page of this book. 

The first chapter is a short tutorial for the spreadsheet pro- 
gram, Microsoft Excel. Concepts and techniques are provid- 
ed with the familiar Excel for most of readers. After readers 
learn these techniques with Excel for hands-on exercises, the 
last chapter talks about the use of the programming language 
R, which is appropriate for data science aiming at scalability. 
Foreman provides sample analyses in R with the same data- 
sets and problems in previous chapters, thereby expanding 
the reader's understanding of how the earlier techniques 
work in R environment, which focuses on analytics com- 
pared with Excel. He also provides a list of reference books 
on R at the end of this chapter for someone who wants to 
learn more about R. 

This book consists of ten chapters that delve into the fol- 
lowing topics: 

♦ Cluster analysis 

♦ Nut graphs 

♦ k-means 

♦ Artificial intelligence 

♦ Regression 

♦ Ensemble models 

♦ Forecasting 

♦ Outlier detection 
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These topics are introduced with eye-catching chapter titles, 
for example, "Naive Bayes and the Incredible Lightness of 
Being an Idiot." In the book, Foreman associates data sci- 
ence with other terms, such as business analytics, operations 
research, business intelligence, competitive intelligence, data 
analysis and modeling, and knowledge extraction, and those 
techniques show a glimpse of data science. In addition, each 
chapter offers pertinent datasets that readers can use in their 
hands-on exercise. Graphics and screen captures are present- 
ed to help the reader keep up with the concepts and exercises 
that are introduced. 

As Foreman intended for this book to be an "introduction 
to the practice of data science in a comfortable and conver- 
sational way," with particular attention given to clarity over 
mathematical correctness, readers can even attempt this 
book as enjoyable reading while on vacation this summer. 
His Twitter handle is @John4man, and readers might want 
to follow him. And please do not forget to visit the pub- 
lisher's website to listen to his introduction of this book and 
download datasets corresponding to the chapters at http:// 
www.wiley.com/go/datasmart. 
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